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The compartmental models used to study epidemic spreading often assume the same susceptibility for all 
individuals, and are therefore, agnostic about the effects that differences in susceptibility can have on 
epidemic spreading. Here we show that-for the SIS model-differential susceptibility can make networks 
more vulnerable to the spread of diseases when the correlation between a node's degree and susceptibility are 
positive, and less vulnerable when this correlation is negative. Moreover, we show that networks become 
more likely to contain a pocket of infection when individuals are more likely to connect with others that have 
similar susceptibility (the network is segregated). These results show that the failure to include differential 
susceptibility to epidemic models can lead to a systematic over/under estimation of fundamental epidemic 
parameters when the structure of the networks is not independent from the susceptibility of the nodes or 
when there are correlations between the susceptibility of connected individuals. 



The contact networks that underlie the spread of diseases, behaviors 1 and ideas have heterogeneous topol- 
ogies 2 4 , but also, exhibit heterogeneity in the susceptibility of individuals 711 . In recent years the compart- 
mental models used to study the spread of behaviors have been generalized to include the effects of network 
topology, but not the effects of both topology and differences in the susceptibility of individuals 4,12,13 . 
Generalizations including the network topology have revealed an intimate connection between the spectral 
properties of the contact network, and the basic reproductive number of infectious diseases, showing that for 
a network described by an arbitrary degree distribution the basic reproductive number of an infection CR 0 )> is 
proportional to the largest eigenvalue of the contact network's adjacency matrix 13 . For highly heterogeneous 
networks, this eigenvalue is always larger than 1 meaning that network heterogeneity can reduce and even 
eliminate the existence of an epidemic threshold. 

Our understanding of the role of heterogeneous network topologies in epidemic spreading, however, has not 
been matched by a comparable development in our understanding of the role of heterogeneity in the susceptibility 
of individuals. Yet, differential susceptibility, defined as the variation in the susceptibility of individuals is as 
widespread as network heterogeneity. For example, genetic conditions are known to cause heterogeneous reac- 
tions to HIV 7,8 , H5N1 influenza 9 , and the Encephalomyocarditis virus 10 . Differential susceptibility can also be the 
result of differences in age as it has been shown in the case of Hantaan Virus in mice 11 . Other mechanisms leading 
to differential susceptibility include previous disease history, obesity, stress, history of drug abuse, physical 
trauma or differences in healthcare quality, which could emerge from discriminatory practices or individual 
self-selection. The biological prevalence of differential susceptibility, therefore, invites us to ask whether relaxing 
the assumptions of homogeneous susceptibility has consequences for the spread of epidemics that are tantamount 
to the relaxation of assumptions of homogeneity in the connectivity of the contact network. 

The incorporation of differential susceptibility into epidemic models, however, also introduces a new dimen- 
sion to epidemic modeling, since there are multiple ways for individuals with differences in susceptibility to be 
arranged in a network. For instance, the mixing patterns and segregation of populations 14 imply that differential 
susceptibility can be structured through non-trivial correlations. Examples here include schools, nursing homes 
and hospitals, where children, senior citizens and patients, who can be more susceptible to diseases, spend more 
time together. These mixing patterns imply that a complete study of differential susceptibility should consider not 
only variations in the susceptibility of individuals, but also the correlations between the susceptibility of indivi- 
duals and the positions these occupy in a network. 
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Understanding how epidemic spreading is affected by differential 
susceptibility can affect a number of policy decisions, since epidemic 
models do not only inform the spread of infectious diseases, but also 
the spread of behaviors 15 , such as smoking 16 ; health conditions, such 
as obesity 1 ; and digital threats, such as computer and mobile phone 
viruses 17 . Here, however, we show that the failure to consider differ- 
ences in the susceptibility of individuals can lead to over- or under- 
estimate a network's vulnerability to epidemic spreading. 

In this paper we solve the SIS epidemic model for a contact net- 
work with arbitrary network topology and differential susceptibility 
and show that its basic reproductive number R 0 is proportional to the 
maximal eigenvalue of a topology-infection matrix that combines 
information on the topology of the network and the susceptibility 
of individuals. Using mean-field theory, we look at individual level 
correlations between the susceptibility and the degree or connectivity 
of individuals and show that positive correlations between suscept- 
ibility and degree makes the network more vulnerable to epidemics 
(increasing R 0 ), whereas negative correlations make the network less 
vulnerable (decrease R 0 ). Finally we look at segregation by studying 
the consequences of having individuals connected to other indivi- 
duals with similar characteristics and show that segregation signifi- 
cantly increases the vulnerability of the network to disease - causing 
R 0 to increase. To conclude, we illustrate the strong effects of segrega- 
tion dynamics on R 0 by running a variant of Schelling's segregation 
process 1 ' on a real-world contact network obtained from face-to-face 
proximity between students and teachers. This shows then even a 
mild level of segregation can drastically increase the critical repro- 
ductive number in a network where individuals differ in their 
susceptibility. 

Results 

The basic reproductive number R 0 in the SIS model. We begin by 
summarizing the main results for the SIS model in heterogeneous 
networks without differential susceptibility. This will help us 
introduce the methodology and notation that we will use later, and 
will also help us compare known results with those obtained for 
networks when differential susceptibility is present. 

In the SIS model individuals can exist in either of two possible 
states: "healthy" or "infected". Healthy individuals are infected when 
they come into contact with an infected individual with probability 
Infected individuals, on the other hand, become once again suscept- 
ible with a recovery probability S. When each individual is in contact 
with k others, the basic reproductive number of a homogeneous 
network takes the form: 



R»=^ 



d 



where the superscript h is used to indicate a homogeneous network. 
Rq can be interpreted as the average number of new infections that an 
infected individual generates during his infective period in a fully 
susceptible population. _R 0 is the quintessential epidemiological para- 
meter, since the infection can only spread when an infected indi- 
vidual gives rise to one or more new infected (_R 0 > 1 ) • Because of this, 
a central question in epidemiology is under what conditions R 0 
becomes greater than 1. 

In degree heterogeneous networks with no degree-degree correla- 
tions and a degree distribution given by P{k) the basic reproductive 
number R 0 generalizes to 3,4 



nunc 
K 0 '- 



fig) 

8 (k) 



=SZ 1 



(2) 



where the superscript unc stands for uncorrelated networks, (k) is the 
average degree of the contact network, (k 2 ) is the average of the 
degrees squared and is the standard deviation of the degree dis- 
tribution. Note that when there is no heterogeneity, equation (2) 



reduces to equation (1), otherwise R^ nL > _R 0 ! since (k 2 ) > (k) 2 . In fact, 
for highly heterogeneous networks, where the degree distribution 
follows a power-law P(k) oc k~" with a < 3, (and therefore R 0 ) 
grows with the network size, implying that the epidemic threshold 
vanishes for infinitely large networks (_R 0 is always larger than 1 ) . It is 
worth noting that this result was obtained using the heterogeneous 
mean-field (HMF) theory, which neglects both dynamical and topo- 
logical correlations. In HMF theory, the actual quenched structure of 
the network given by its adjacency matrix A y - is replaced by an 
annealed version, in which edges are constantly rewired at a rate 
much faster than that of the epidemics, while preserving the degree 
distribution P(k) 5 . 

In most real world networks nodes are not connected randomly to 
other nodes, and this effect is characterized by what is known as non- 
trivial mixing patterns 14 . In the case of degree heterogeneity these 
mixing patterns are captured by the conditional probability P(k'\k) 
that a link starting at a node with degree k will end at a node with 
degree k' An this case, the basic reproductive number takes the form 12 



ncorr 
K 0 



-- d h, c 



(3) 



where Ai c is the largest eigenvalue of the degree mixing matrix C^- 
= kP(k'\k). 

While HMF theory assumes annealed networks, its validity for real 
(quenched) networks is limited 6 . Quenched networks are static, 
given by the adjacency matrix A, where a» = 1 if there is a link 
connecting node i to node j and 0 otherwise. A significant improve- 
ment over the HMF theory is given by the quenched mean-field 
(QMF) theory where it has been shown that the basic reproductive 
number takes the form 13 : 



R 



QMF 



P. 

= s AlA 



(4) 



where A liA is the largest eigenvalue of the matrix A. 



Differential susceptibility. Next, we introduce differential suscep- 
tibility by assuming that the susceptibility probability /?, is different 
for each node i in the network. To further clarity, we consider fS t in 
terms of the probability that a susceptible individual i gets infected 
when in contact with an infected individual. 

After taking this consideration into account we proceed by follow- 
ing Wang et al. 13 and note that for an arbitrary network topology 
given by an irreducible non-negative adjacency matrix A = [fly];vxN' 
the evolution of the SIS model can be written following as: 

p i (t+l)^(l-p,(t))f,(t) + (l-8)p,(t) (5) 

where p t {t) is the expected probability that node i will be infected at 
time r, and /(f) is the probability that node i receives the infection 
from at least one of its infected neighbors at time t. The probability 
fi(t) has the form: 



/ i (t) = i-.n(i-Aa^(t)). 



(6) 



Equation (5) is a non-linear dynamical system in which the case of no 
infection in the network (p, = 0, i € {1, N}) represents a fixed 
point of the system that becomes unstable when _R 0 > 1. To test for 
asymptotic stability, we linearize the system (5) around p = 0 by 
removing all higher-order terms of p,-(r): 

N N 

Pi {t+l) = (l-d) Pi (t) + ]T Pi^pjit) = «s»W- ( 7 ) 

;=1 ;=1 

We note that here we have assumed no differences in the recover- 
ability rate of individuals (i.e. delta is constant). Next, we express (7) 
in matrix form as: 
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p(t+\)=Mp(t) 



(8) 



where M = [my], = Pfiu + Ay(l — S) and A,y is the Kronecker 
delta. Thus, one gets that the state with no disease is asymptotically 
stable if the largest eigenvalue of M, A lM < I. 

We note that when the origin is asymptotically stable, it is also 
globally stable, since 



a -moi/ko </*(*)<£/*«« 



(9) 



where the second inequality follows directly from the Weierstrass 
product inequality. Using this, we see that the system satisfies the 
inequality 



p(f)<Mp(f-l)<M f p(0). 



(10) 



Thus, when a 1 M < 1 , the infection will die out exponentially fast with 
a rate determined by ?.\ i m- 

Solutions with incomplete information. The connection between 
the largest eigenvalue of M (Xija) and the epidemic threshold 
represents a solution of the system that has little practical use in 
absence of complete information about the network topology and 
the susceptibility of individuals. For the model to be of practical use 
we need to estimate A 1M when there is incomplete information about 
the topology of the network and/or the distributions of susceptibility 
probabilities of individuals. First, we note that the matrix M = _R + (1 
— 8)1 where _R = [r,j], r t j = /?,a« and / is the identity matrix. 
Therefore, we can define the threshold at which epidemics begin to 
spread through the largest eigenvalue of R and generalize R 0 in 
equation (4) to: 



(ii) 



To estimate when there is incomplete information about the 
system, we assume that r,j is a random variable following an 
arbitrary distribution and use a mean-field approximation ry ~ 
(r«) where {r {J ) is the expected value of r i} over all possible network 
realizations. In a network where the susceptibility is assigned 
independently of the topology, we have that (rJa^) = (j6|fly) fly = 
(fS) a f j where (x\y) is the expected value of x given y. Then we obtain 
the basic reproductive number 



iC = 0^i,A + l-<5. 



12 



In the case of uncorrelated networks with heterogeneous degrees, we 
relax (r«) to the expected number of links between nodes i and j, 
which is proportional to the product of the degrees of i and j. Keeping 
arbitrary susceptibility probabilities and degrees, we have 

kikj 



N(k) 



13 



Since _R is an irreducible matrix, the eigenvector v associated with the 
maximal eigenvalue Ajjj is strictly positive. Additionally, there are no 
other positive eigenvectors except positive multiples of v. In this case, 
using the positive eigenvector v = [ v,] with v,- = kfi t , one immediately 
obtains the maximal eigenvalue, and therefore the basic reproductive 
number 



rttlllC 

K o z 



+ 1-8 = 



+ 1-8. 



(14) 



where — 1 < p < 1 is the Pearson correlation coefficient between the 
susceptibility /?< and the square of the degree k t of individuals, (/?) and 
(fc 2 ) are their respective averages, and a« and <jja are their respective 
standard deviations. Looking at equation (14), we point out that even 
small correlations between the susceptibility and the degree can lead 



to significant over- or underestimation of R 0 when the variation in 
connectivity, as measured by op, is large compared to the average 
connectivity (k), which is the for networks following a heterogeneous 
degree distribution. 

Networks with non-trivial mixing patterns. We now focus on link 
level correlations, where the tendency of individuals to connect to 
other individuals with similar characteristics leads to non-trivial 
mixing patterns. We note that the same mathematical procedure 
can be used for the analysis regardless of whether we have 
correlations in susceptibility and constant recovery probability or 
vice versa. We consider link level correlations in the degree and 
the susceptibility of nodes. Then we approximate r y - with (r,y), by 
using the expected number of links from node with degree k, and 
susceptibility [i, to node with degree kj and susceptibility fiy The 
expected number of links are proportional to the two-point 
conditional probability P(k', fS'\k, ft): 



r,r{r,,)-- 



NPlkjJ, 



(15) 



To find the maximal eigenvalue of the matrix R as defined by 
equation (15) we need to find a positive vector v = [v ; ] such that 



N 



p u 



(16) 



holds for all i. Using some algebraic manipulations, we can rewrite 
equation (16): 



N 



E 



N 



E 



p k 



MM) 



P{k',0) 



E 



(17) 



which holds for every i. Writing j,fa=ff p =i3' v j 
ming these N equations over i where k t = k and ji 

N 



-■ Vfj pi and sum- 
= /?, we have 



Vk,ph,R 



E 



i,ki = k,p, = p 



v P(fc'/|fc„/j,) 

^ WJ) « J 



(18) 



k'.H' 



which holds for every combination of k and [3. We can simplify 
equation (18) further by writing Vk,p = P(k,P)vk,p- 

h,ph,R = E WP{k',l)'\k,fi)v UJi = A UD V k ,p. (19) 
k'.P' 

where X 1:D is the maximal eigenvalue of the matrix D\k,p\, {k',p'\ = 
kpP(k', P'\k, fi). In other words, the matrices R and D share the same 
maximal eigenvalue. So, we will use \ liD as an approximation of the 
actual R 0 when there is limited information in the network: 



Ro=Xi# + l-8~k l!D + l-S. 



(20) 



Assuming d=|{fc, [3}\ is the number of different combinations of 
degree (k) and susceptibility (/?) that a node can have in the network, 
we have compressed the entire information about the system (net- 
work topology and the susceptibilities of nodes) into a coarsened d X 
d matrix. In fact, we can choose d depending on how much informa- 
tion we have about the network. For example, we can assign each 
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individual to one of 5 degree classes, k = 1, 2, 4, 8, 16 and to one of 3 
susceptibility classes, p = Pi ow , P avg , [thigh corresponding to low, 
average and high susceptibility respectively. We then only need to 
estimate the mixing patterns P(k', P'\k, P) between the 15 classes of 
individuals, as opposed to knowing every entry in the matrix R. The 
more classes we have, the more mixing patterns we have to estimate 
and the closer ?^ jj will be to the actual 

Moreover, if we assume independence between the degree of a 
node and its susceptibility, i.e. P(fc', fi'\k, p) = P(k'\k)P(P'\p) and 

choose v^/j = vj}.V|j where v 1 = [v£] and v 2 = jv^J are the positive 

eigenvectors corresponding to the maximal eigenvalues l l>c and 
I i t B of the matrices Cm = kP(k'\k) andBpp' = pP(p'\P) respectively, 
we can simplify equation (19): 

= E kp (k'\k)vl E Wl^H = KcKbv^ (21) 

which gives 2^ = Ai,c^i,b an d solves the system of equations. Note 
that when there is no degree mixing, l l>c = (k 2 ) I (k). On the other 
hand, when there are no mixing patterns in susceptibility, = (p). 

Equation (19) allows us to estimate the basic reproductive number 
at different granularity depending on how much information we 
have about the network topology and the distribution of susceptibil- 
ities. However, from the scientific perspective it is still unclear how 
the different mixing patterns impact the spread of diseases. To push 
our understanding of the effect of differential susceptibility further 
we study the effects of segregation in the spread of diseases. 

Segregation. Since the contact networks that underlie the spread of a 
disease are not only heterogeneous in terms of degree and 
susceptibility, but also segregated, we next proceed to solve the 
model for the cases where individuals are more likely to connect 
with others a similar level of susceptibility. As we will see, 
segregation can have an impact in the spreading of a disease that 
goes beyond the effects of differential susceptibility. 

To understand the effect of segregation, we focus on the case where 
the degree of a node is independent of its susceptibility, a 1>r = 
^•i,c^i3> an d look respectively, at the limiting cases when there is 
no segregation and maximal segregation. Note that in the case of 
no segregation P(p'\P) = P(fS') and the largest eigenvalue of £ is 2 1>B 
= (/J) (with eigenvector v, Vp = /i), hence the basic reproductive 
number takes the form 



Ro = 08)Ai,c+l-<5- 



(22) 



On the other hand, in the case of maximal segregation, where nodes 
only share links with nodes with the same susceptibility, B is diagonal 
with p's as the diagonal elements. In this case, is equal to the 
largest susceptibility in the system P max . Thus, in the case of maximal 
segregation, the basic reproductive number becomes 



«o=ft«Ac+i-a- 



(23) 



We note that in this extreme case nodes do not share links outside 
their susceptibility class, so the network is made of disconnected 
components. Hence, R 0 > 1 implies a persistent infection in at least 
one of these components. We do want to stress that in general R 0 > 1 
does not guarantee a macroscopic outbreak in the network, but the 
existence of a highly vulnerable pocket, or subgraph, where R 0 is 
above 1. In this case, the epidemic will be persistent, but will most 
likely remain contained. 

To obtain general bounds for _Rj, orr = X\ ,c^i,b + 1 — <5> we use the 
Collatz-Wielandt formula 20 , which states that the Perron root of a 
matrix A is given by r = max vefJ /(v), where 

/(v) = min — ^ a 'i v i and = {v| v > 0 with v # 0} . (24) 



Similarly, the min-max version states that r = min ve£J /(v) where 

f(v) = max — anVj. (25) 
i 

We rewrite the min-max and max-min versions of the Collatz- 
Wielandt formula into a form that we will use in the rest of the paper 



min — ajjVj <r< max — cuiVt 



(26) 



which holds for all non-negative non-zero vectors v. Assuming the 
matrix B is irreducible, the Perron root of B coincides with the largest 
eigenvalue /t l fl . Thus, using equation (26), we can bound 1 1>B 



min -V P(/?'| 



■<hj< max^Vp(/f'| 



(27) 



where v = [vp] can be any non-negative vector. Choosing the vector 
v such that Vp = p in equation (27), we immediately obtain the 
bound: 



).\ c min 

9 



-l-8<R 0 <Z hC max{f3\p') 

0 



■1-8 (28) 



where (P\P') denotes the average susceptibility of the individuals 
connected to an individual with susceptibility ft' . These bounds pro- 
vide useful information to understand the potential impact of a dis- 
ease under limited knowledge about the network topology. We note 
that weak segregation limits the variation of (P\P') across the differ- 
ent susceptibility classes bounding R 0 in a small region around (p). 
Strong segregation on the other hand, increases the gap between the 
bounds and our uncertainty for R 0 . 

To extend our intuition further we consider a simple network 
model with tunable segregation. Here, each node has the same degree 
{k)andP{R'\P) = (1 - s)/N fi and P(f}\R) = (1 - s)/N, s + swhereN^ 
is the number of different susceptibility classes and 5 6 [0, 1] models 
the segregation in the network. When s = 0, P(/3'|/?) = l/Na = 
P(P\P) and the chances that a node will connect to others is inde- 
pendent of the susceptibility p. When, 5=1, segregation is maximal 
(P(p\p) = 1 and P(P'\P) = 0) and nodes share links only with others 
that have the same susceptibility. In this model, R 0 = Xi^-i,b + 1 ^ <5 
= (k) X l B + 1 — 5 since all nodes have the same degree (A:). To derive 
bounds for R 0 in this segregation model, we note that P(p'\p) = 
(1 — s)/Nfj + Apfi'S where A««' is the Kronecker delta. Putting this 
into equation (27) we have 



i < max 



'Ev+^- 



(29) 



Then choosing Vp = PP(P) and using the fact that P(P) = 1/Np we 
have 



1 UB < max( 1 - 5) </J> +sp = (l-s) (P) + sp max . 
On the other hand, the lower bound for a 1>b has the form 



(30) 



(31) 



9 



where again v = [vp] can be any non-negative vector. Note that if 
we choose Vp = 1 if p = p max and 0 otherwise, we get the lower 
bound 

P max [^s)/N fi +s]<l ltB . (32) 
Thus, we obtain the bounds for the basic reproductive number: 



1-s 



+ s 



P max + 1 - <5 < R 0 < (k) [( 1 - s) (P) +sp max ] + 1-5.(33) 
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100 



Figure 1 | Impact of segregation on R 0 . (a) Rq = X^b (fc) + 1 — 8 as a function of the amount of segregation s (solid line) along with the bounds 
given by equation (33) (dashed lines) with Np = 5 susceptibility classes uniformly spaced between [0.005, 0.025] with (fi) = 0.015. Additionally, to 
compute an actual value for Rq a degree k = 25 and S = 0.5 was assumed, (b) Rq = A 1-R (solid line) and the approximation R 0 ~ a 1?d (dashed line) for the 
face-to-face proximity school network for different iterations of the segregation process. 



a) No segregation b) Mild segregation 




Figure 2 | The endemic state for different level of segregations and different susceptibility, (a), (b) and (c) face-to-face proximity school network 
with no, mild and high segregation respectively. High susceptibility nodes are colored red. d) The fraction of infected nodes in the endemic state averaged 
over 10,000 runs as a function of the susceptibility /?! for the school network with three levels of segregation, along with the critical values of fl 1 for which 
Ro = /t u; + 1 — (5 = 1 (inset) denoted as horizontal dashed lines. 
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Fig. la shows the basic reproductive number _R 0 as a function of the 
amount of segregation s (solid line) along with the bounds given by 
equation (33) (dashed lines). We observe that as the amount of 
segregation increases, the network develops a highly susceptible 
pocket and _R 0 tends to (A:) p max +1 — 8, This increase in vulner- 
ability comes from highly susceptible groups of individuals that pro- 
vide a stable pocket supporting the infection. The results show that 
segregation makes the network of individuals only as strong as its 
weakest subgroup. This suggests an immunization strategy that seeks 
to identify and target clusters of highly susceptible individuals, 
instead of groups of highly susceptible individuals that are connected 
to others that are less susceptible. 

To illustrate the effects of segregation, we look at a real-world 
contact network of face-to-face proximity between students and tea- 
chers in a primary school 18 . Links between A and B denote the 
cumulative time spent by A and B in face-to-face proximity, over 
one day. For simplicity, we convert this daily network to an 
unweighted network by assigning a link between the pair of nodes 
that have spent more than 2 minutes of cumulative time in face-to- 
face proximity. After discretizing the links, we keep the largest con- 
nected component which consists of 225 nodes. We then run a 
variation of the Schelling's segregation process 19 where initially each 
node is randomly assigned to one of two susceptibility classes. We use 
<5 = 0.5 and assign high susceptibility nodes p hjgh = and low 
susceptibility nodes with /? = PJIO, thus having only one parameter 
in the network, j8i. To segregate this initial network, we assign to each 
node i a potential energy — (/?),| where (/?), is the average sus- 
ceptibility of ;"s neighboring nodes. At each iteration, we swap a 
random pair of nodes if this decreases the total potential energy in 
the network. With each new iteration, we conserve the distribution of 
total susceptibility in the network but increase the level of segrega- 
tion. Fig. lb shows the increase of R 0 = A 1:R (solid line) with iterations 
of Schelling's segregation process. The dashed line on the other hand 
shows the approximation R 0 ~ a 1 d when there is limited informa- 
tion in the network. The matrix D is constructed by coarse-graining 
the network into 7 degree classes k — {1, 2, 4, 8, 16, 32, 64} where each 
node is assigned to its nearest degree class and two susceptibility 
classes ft = fiJlO}. It is worth mentioning that we compute 
i-i,D by estimating only 14 2 entries of the matrix D, as opposed to 
the 225 2 entries of the matrix _R, which is a significant reduction in the 
amount of information we need about the system. 

Fig. 2a, 2b and 2c show the primary school network at three 
different iterations in the segregation process; no segregation, mild 
segregation and high segregation respectively, with high susceptibil- 
ity nodes colored red. Fig. 2d shows the average fraction of infected 
nodes in the endemic state, p, averaged over 10, 000 simulations, for 
different values of Pi for the three networks shown in Fig. 2a, b and c. 
As we mentioned earlier, and as can be seen in Fig. 2d, segregation 
can change the shape of the epidemic curve and depending on the 
network topology, high R 0 does not necessarily lead to a high number 
of infected nodes in the endemic state. This can happen when there is 
extreme segregation and the infection is contained within a small 
number of highly susceptibility nodes that have no or little contact to 
the rest of the network. In practice, however, we often observe mild 
segregation which increases both _R 0 and the number of infected 
nodes in the endemic state. Finally, we compute the critical values 
of pi using equation (11). The inset of Fig. 2d shows p near these 
critical values along with the critical values (horizontal dashed lines). 

Discussion 

In this paper, we extended the SIS model to incorporate heterogen- 
eous susceptibility and showed that heterogeneity can significantly 
increase the networks' vulnerability to diseases. For individual level 
correlations between the susceptibility and the degree of a node, we 
find that approaches using the average susceptibility of the system to 
approximate R 0 will over- or underestimate the potential spread of 



diseases. In other words, when there is variation in susceptibility, the 
increase in susceptibility of a few individuals is not necessarily com- 
pensated by the decrease in susceptibility of others, since the degrees 
and locations of these individuals play an important role. On the other 
hand, we found that when nodes with similar characteristics are more 
likely to be connected, the vulnerability of the network increases, since 
a small group of densely connected high-susceptible individuals can 
act as a pocket supporting a persistent infection. Having a formula to 
compute R 0 , however, has little practical use in absence of complete 
information about the network topology and the distribution of sus- 
ceptibilities. In this case, we provided a method to approximate R 0 by 
coarse-graining the individual nodes into degree-susceptibility classes 
and only estimating the mixing patterns between these classes. Going 
forward, it is important that mathematical models of epidemic spread- 
ing include the effects of heterogeneous susceptibility to provide more 
accurate descriptions of epidemic spreading processes. Otherwise, the 
basic reproductive numbers estimated from compartmental models 
will be systematically over or underestimated. 
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